Search CORE

Distinct, ecotype-specific genome and proteome signatures in the marine cyanobacteria Prochlorococcus

Author: Bag Sumit K
Das Sabyasachi
Dutta Anirban
Dutta Chitra
Paul Sandip
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

The marine cyanobacterium Prochlorococcus marinus, having multiple ecotypes of distinct genotypic/ phenotypic traits and being the first documented example of genome shrinkage in free-living organisms, offers an ideal system for studying niche-driven molecular micro-diversity in closely related microbes. The present study,through an extensive comparative analysis of various genomic/proteomic features of 6 high light (HL) and 6 low light (LL) adapted strains, makes an attempt to identify molecular determinants associated with their vertical niche partitioning. Pronounced strand-specific asymmetry in synonymous codon usage is observed exclusively in LL strains. Distinct dinucleotide abundance profiles are exhibited by 2 LL strains with larger genomes and G+C-content ≈ 50% (group LLa), 4 LL strains having reduced genomes and G+C-content ≈ 35-37% (group LLb), and 6 HL strains. Taking into account the emergence of LLa, LLb and HL strains (based on 16S rRNA phylogeny), a gradual increase in average aromaticity, pI values and beta- & coil-forming propensities and a decrease in mean hydrophobicity, instability indices and helix-forming propensities of core proteins are observed. Greater variations in orthologous gene repertoire are found between LLa and LLb strains, while higher number of positively selected genes exist between LL and HL strains. Strains of different Prochlorococcus groups are characterized by distinct compositional, physicochemical and structural traits that are not mere remnants of a continuous genetic drift, but are potential outcomes of a grand scheme of niche-oriented stepwise diversification, that might have driven them chronologically towards greater stability/fidelity and invoked upon them a special ability to inhabit diverse oceanic environments

Molecular signature of hypersaline adaptation: insights from genome and proteome composition of halophilic prokaryotes

Author: Bag Sumit K
Das Sabyasachi
Dutta Chitra
Harvill Eric T
Paul Sandip
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

A comparative genomic and proteomic study of halophilic and non-halophilic prokaryotes identifies specific genomic and proteomic features typical of halophilic species that are independent from genomic GC-content and taxonomic position

Comparative Transcriptional Profiling of Contrasting Rice Genotypes Shows Expression Differences during Arsenic Stress

Author: Archana Bhardwaj
Arti Rai
Bijan Adhikari
Debasis Chakrabarty
Doi K.
Prabodh K. Trivedi
Prashant Misra
Prestamo G.
Rudra D. Tripathi
Sumit K. Bag
Publication venue: 'Crop Science Society of America'
Publication date
Field of study

Public Library of Science (PLOS)

Oligonucleotide Frequencies of Barcoding Loci Can Discriminate Species across Kingdoms

Author: A Campbell
A Cywinska
Antariksh Tyagi
AV Borisenko
C Moritz
CA Machado
CP Meyer
D Lipscomb
D Rubinoff
D Rubinoff
D Tautz
E Marshall
E Mayr
E Pennisi
EHK Wong
ES Tavares
H Nakashima
H Nakashima
I Meusnier
IN Sarkar
J Rach
JD Thompson
K Tamura
KA Seifert
Klaus F. X. Mayer
KW Will
M Hajibabaei
M Hajibabaei
M Hajibabaei
M Takahashi
M Vences
PDN Hebert
PDN Hebert
PDN Hebert
R DeSalle
R Lahaye
Rakesh Tuli
RR Hudson
S Karlin
S Karlin
S Karlin
S Karlin
S Karlin
SG Newmaster
Sribash Roy
Sumit K. Bag
T Abe
V Savolainen
Virendra Shukla
WJ Kress
WJ Kress
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: DNA barcoding refers to the use of short DNA sequences for rapid identification of species. Genetic distance or character attributes of a particular barcode locus discriminate the species. We report an efficient approach to analyze short sequence data for discrimination between species. Methodology and Principal Findings: A new approach, Oligonucleotide Frequency Range (OFR) of barcode loci for species discrimination is proposed. OFR of the loci that discriminates between species was characteristic of a species, i.e., the maxima and minima within a species did not overlap with that of other species. We compared the species resolution ability of different barcode loci using p-distance, Euclidean distance of oligonucleotide frequencies, nucleotide-character based approach and OFR method. The species resolution by OFR was either higher or comparable to the other methods. A short fragment of 126 bp of internal transcribed spacer region in ribosomal RNA gene was sufficient to discriminate a majority of the species using OFR. Conclusions/Significance: Oligonucleotide frequency range of a barcode locus can discriminate between species. Ability to discriminate species using very short DNA fragments may have wider applications in forensic and conservation studies

CiteSeerX

Directory of Open Access Journals

Molecular association of glucose-6- phosphate isomerase and pyruvate kinase M2 with glyceraldehyde-3-phosphate dehydrogenase in cancer cells

Author: A Sawa
Alok Ghosh
Arup K. Bag
AT Cordeiro
B Khatua
B You
C Tristan
Chitra Mandal
D Anastasiou
D Kozakov
D Kozakov
D Schneidman-Duhovny
D Wang
DE Epner
E Krissinel
EJ Tisdale
F Ferreira-da-Silva
H Nakajima
HM Berman
HR Christofk
JH Roe
JJ Laschet
JL Jenkins
K Tokunaga
M Cortés-Cros
M Ray
M Torchala
Mahua R. Das
Manju Ray
N Schek
O Warburg
PA Saunders
PE Glaser
Provas Das
Q Huang
Q Zhang
RA Gomes
RE Ferguson
S Bagui
S Carujo
S Mazurek
S Patra
S Ray
S Saha
S Saha
Saikat Chakrabarti
Shekhar Saha
Siddhartha S. Jana
Subhankar Ray
Sumit K. Dey
T Funasaka
T Funasaka
T Li
T Noguchi
V Ravichandran
W Luo
W Yang
W Yang
X Gao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: For a long time cancer cells are known for increased uptake of glucose and its metabolization through glycolysis. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is a key regulatory enzyme of this pathway and can produce ATP through oxidative level of phosphorylation. Previously, we reported that GAPDH purified from a variety of malignant tissues, but not from normal tissues, was strongly inactivated by a normal metabolite, methylglyoxal (MG).Molecular mechanism behind MG mediated GAPDH inhibition in cancer cells is not well understood. Methods: GAPDH was purified from Ehrlich ascites carcinoma (EAC) cells based on its enzymatic activity. GAPDH associated proteins in EAC cells and 3-methylcholanthrene (3MC) induced mouse tumor tissue were detected by mass spectrometry analysis and immunoprecipitation (IP) experiment, respectively. Interacting domains of GAPDH and its associated proteins were assessed by in silico molecular docking analysis. Mechanism of MG mediated GAPDH inactivation in cancer cells was evaluated by measuring enzyme activity, Circular dichroism (CD) spectroscopy, IP and mass spectrometry analyses. Result: Here, we report that GAPDH is associated with glucose-6-phosphate isomerase (GPI) and pyruvate kinase M2 (PKM2) in Ehrlich ascites carcinoma (EAC) cells and also in 3-methylcholanthrene (3MC) induced mouse tumor tissue. Molecular docking analyses suggest C-terminal domain preference for the interaction between GAPDH and GPI. However, both C and N termini of PKM2 might be interacting with the C terminal domain of GAPDH. Expression of both PKM2 and GPI is increased in 3MC induced tumor compared with the normal tissue. In presence of 1 mM MG,association of GAPDH with PKM2 or GPI is not perturbed, but the enzymatic activity of GAPDH is reduced to 26.8 ± 5 % in 3MC induced tumor and 57.8 ± 2.3 % in EAC cells. Treatment of MG to purified GAPDH complex leads to glycation at R399 residue of PKM2 only, and changes the secondary structure of the protein complex. Conclusion: PKM2 may regulate the enzymatic activity of GAPDH. Increased enzymatic activity of GAPDH in tumor cells may be attributed to its association with PKM2 and GPI. Association of GAPDH with PKM2 and GPI could be a signature for cancer cells. Glycation at R399 of PKM2 and changes in the secondary structure of GAPDH complex could be one of the mechanisms by which GAPDH activity is inhibited in tumor cells by MG

Transcriptomic and metabolomic shifts in rice roots in response to Cr (VI) stress

Abstract Background Widespread use of chromium (Cr) contaminated fields due to careless and inappropriate management practices of effluent discharge, mostly from industries related to metallurgy, electroplating, production of paints and pigments, tanning, and wood preservation elevates its concentration in surface soil and eventually into rice plants and grains. In spite of many previous studies having been conducted on the effects of chromium stress, the precise molecular mechanisms related to both the effects of chromium phytotoxicity, the defense reactions of plants against chromium exposure as well as translocation and accumulation in rice remain poorly understood. Results Detailed analysis of genome-wide transcriptome profiling in rice root is reported here, following Cr-plant interaction. Such studies are important for the identification of genes responsible for tolerance, accumulation and defense response in plants with respect to Cr stress. Rice root metabolome analysis was also carried out to relate differential transcriptome data to biological processes affected by Cr (VI) stress in rice. To check whether the Cr-specific motifs were indeed significantly over represented in the promoter regions of Cr-responsive genes, occurrence of these motifs in whole genome sequence was carried out. In the background of whole genome, the lift value for these 14 and 13 motifs was significantly high in the test dataset. Though no functional role has been assigned to any of the motifs, but all of these are present as promoter motifs in the Database of orthologus promoters. Conclusion These findings clearly suggest that a complex network of regulatory pathways modulates Cr-response of rice. The integrated matrix of both transcriptome and metabolome data after suitable normalization and initial calculations provided us a visual picture of the correlations between components. Predominance of different motifs in the subsets of genes suggests the involvement of motif-specific transcription modulating proteins in Cr stress response of rice.</p

Directory of Open Access Journals

Public Library of Science (PLOS)

Universal Plant DNA Barcode Loci May Not Work in Complex Groups: A Case Study with Indian Berberis Species

Author: AJ Fazekas
AM Abdalla
AN Schmidt-Lebuhn
AN Schmidt-Lebuhn
Anil Kumar
Antariksh Tyagi
Bhaskar Datt
CBOL
CK Schneider
CP Meyer
D Edwards
D Rubinoff
D Tautz
DL Swofford
DM Spooner
DM Spooner
DV Jobes
E Pennisi
EHK Wong
EJH Corner
EJH Corner
F Hooker
F Miura
FG Nieto
FJ Rohlf
G King
GA Singer
GD Weiblen
GS Puri
I Alvarez
J Rach
J Shaw
J Waugh
JD Lubell
JD Lubell
JF Wendel
JF Wendel
JG Hawkes
JR Harlan
JR Starr
K Onodera
K Tamura
L Ahrendt
Lal Babu Chaudhary
LR Landrum
M Blaxter
M Vences
MA Smith
MC Orsi
MCJ Bottini
MCJ Bottini
MT Cervera
MW Chase
MW Chase
N Ronsted
Narayanan K. Nair
O Seberg
P Taberlet
PB Singh
PDN Hebert
PJ Ersts
Pradhyumna K. Singh
R Desalle
R Lahaye
Rakesh Tuli
RD Ward
RP Kelly
RR Rao
RS Dodd
S Chen
SG Newmaster
Simon Joly
Sribash Roy
Sumit K. Bag
T Gao
Tariq Husain
Uma M. Singh
Virendra Shukla
WB Storey
WJ Kress
WJ Kress
WJ Kress
Y-D Kim
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

BACKGROUND: The concept of DNA barcoding for species identification has gained considerable momentum in animals because of fairly successful species identification using cytochrome oxidase I (COI). In plants, matK and rbcL have been proposed as standard barcodes. However, barcoding in complex genera is a challenging task. METHODOLOGY AND PRINCIPAL FINDINGS: We investigated the species discriminatory power of four reportedly most promising plant DNA barcoding loci (one from nuclear genome--ITS, and three from plastid genome--trnH-psbA, rbcL and matK) in species of Indian Berberis L. (Berberidaceae) and two other genera, Ficus L. (Moraceae) and Gossypium L. (Malvaceae). Berberis species were delineated using morphological characters. These characters resulted in a well resolved species tree. Applying both nucleotide distance and nucleotide character-based approaches, we found that none of the loci, either singly or in combinations, could discriminate the species of Berberis. ITS resolved all the tested species of Ficus and Gossypium and trnH-psbA resolved 82% of the tested species in Ficus. The highly regarded matK and rbcL could not resolve all the species. Finally, we employed amplified fragment length polymorphism test in species of Berberis to determine their relationships. Using ten primer pair combinations in AFLP, the data demonstrated incomplete species resolution. Further, AFLP analysis showed that there was a tendency of the Berberis accessions to cluster according to their geographic origin rather than species affiliation. CONCLUSIONS/SIGNIFICANCE: We reconfirm the earlier reports that the concept of universal barcode in plants may not work in a number of genera. Our results also suggest that the matK and rbcL, recommended as universal barcode loci for plants, may not work in all the genera of land plants. Morphological, geographical and molecular data analyses of Indian species of Berberis suggest probable reticulate evolution and thus barcode markers may not work in this case

CiteSeerX

Directory of Open Access Journals

Design, Development and Implementation of Novel in silico Data Mining, Clustering and Visualization Tools for Comparative Genome and Proteome Analysis

Author: Bag Sumit K
Publication venue
Publication date: 01/01/2012
Field of study

If one recited the human genome sequence at the rate of 5 bases per second for 24 hours a day, it would take one about 19 years to recite the book of life. This is only taking into account 3 billion bases of the genome sequence itself, excluding data annotations and other information associated with sequence data. As of April 15, 2012, GenBank release 189.0 has 139,266,481,398 bases from 151,824,421 reported sequences - nearly 47 times the human genome sequence content (GenBank release note - ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt). The number of sequenced genomes is presently growing at an unprecedented pace and there should be no reason to expect it to slow down in the future. On the contrary, the introduction of genomics at a massive scale with „second-generation sequencing technologies‟ such as 454, Solexa or Solid adds to this expectation (Mardis 2008), as illustrated by the recent genome sequencing of a single individual. Importantly, these new technologies promise to be very useful for genome analysis in non-model organisms (Ellegren 2008; Hudson 2008; Vera et al. 2008). Moreover, the 1000 genomes project will create a new map of genetic variation for our genome. Other projects are helping to catalogue genes involved in cancer, alternative splicing in different tissues and transcription factor binding, for example. The evolution of „omic‟ science through microarray transcriptomics, metabolomics, proteomics, etc. is generating this huge data. This data have begun to revolutionize genomics and their effects are becoming increasingly widespread. There has been a vital need to warehouse, harness, disseminate, analyze and interpret this torrents of data, not only to quench the academic thirst of the researchers, but also for medical diagnostic and therapeutic uses and several other biotechnological applications. For this, use of computational approaches was inevitable and thus emerged Bioinformatics – a new field of scientific enquiry that represents a confluence of biology, computer science and information technology